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SEQUENCE LISTING - WEE virus strain 71V-1658 - SEQ ID NO 1 

. LOCUS AF214040 11484 bp RNA VRL 27-JAN-2000 

DEFINITION Western equine encephalomyelitis virus strain 71V-1658 
nonstructural polyprotein and structural polypratein genes, 

complete cds . 
ACCESSION AF214040 AF143811 
VERSION AF214040.1 GI: 6760410 

KEYWORDS 

SOURCE Western equine encephalomyelitis virus. 

ORGANISM Western equine encephalomyelitis virus Viruses; ssRNA 
positive-strand viruses, no DNA stage; Togaviridae; Alphavirus . 
REFERENCE 1 (bases 1 to 114 84) 

AUTHORS Netolitzky,D. J. , Schmalt z , F . L . , Parker, M.D., Rayner,G.A. 

Fisher, G.R., Trent, D.W., Bader,D.E. and Nagata,Li.P. 
M TITLE Complete genomic RNA sequence of western equine 

0 encephalitis virus and expression of the structural genes 
p JOURNAL J. Gen. Virol. 81 Ft 1, 151-159 (2000) 

lU MEDLINE 20109043 

y REFERENCE 2 (bases 1 to 11484) 

01 AUTHORS Netolitzky^D.J. , Schmalt z , F . L . , Parker, M.D., Rayner^G.A. 
^ Fisher, G.R., Trent, D.W., Bader,D.E. and Nagata,L.P. 

^ TITLE Direct Submission 

3 JOURNAL Submitted (08-DEC-1999) Medical Countermeasures Section, 

N Defence Research Establishment Suffield, P.O. Box 4000, Stn Main, 

fy Medicine Hat, Alberta TIA 8K6, Canada 

fy FEATURES Location/Qualifiers 

N' source 1. ,11484 

P /organism^ "Western equine encephalomyelitis virus" 

H /strain=" 71V- 1658" 

/db_xref="taxon: 1103 9" 

5'UTR <1..24 

misc_feature 1.. 11484 

/note=" sequence of this entry added to the sequence of 
AF214039, strain CBA87, constitutes the Western equine 
encephalomyelitis virus complete genome" 



CDS 25.. 7428 

/note=" encodes nsPl, nsP2 , nsP3 and nsP4 proteins" 
/ codon_start=l 

/transl_except= (pos : 5581 . . 5583 , aa : OTHER) 
/product = "nonstructural polyprotein" 
/prot ein_id= " AAF2 83 3 9.1" 
/db__xref ="GI : 6760411" 

/ 1 r ans 1 a t i on= " MER I HVDLDADS P YVKS LQRTF PQFE I E ARQVTDNDHANARAF S 

HVATKLI ESEVDRDQVI LD I GSAPVRHAHSNHRYHCI CPMI S AEDPDRLQRYAERLKK 



SDI TDKNI ASKAADLLEVMSTPDAETPSLCMHTDATCRYFGS VAVYQDVYAVHAPTS I 



YHQALKGWT I YW I GFDTTPFMYKlSnytAGS YPTYNTNWADERVLEARNI GLGN^ 
RLGKLSILRKKRLQPTNKIIFSVGSTIYTEDRSLLRSWHLPNVFHLKGKSNFTGRCGT 



I VSCEGYVI KKI TI SPGLYGKVENLASTMHREGFLSCPCVTDTLRGERVSFAVCTYVPA 



TLCDQMTG I LATDVSVDDAQKLLVGLNQRI WNGRTQRNTNTMQNYLLPWAQAFSRW 



AREHRADLDDEKELGVRERTLTMGCCWAFKTQKI TS I YKKPGTQTI KKVPAVFDSFVI 



PRLTSHGLDMGFRRRLKLLLE PTVKPAPAI TMADVEHLRGLQQEAEEVAAAEE I REAL 
H PPLLPEIEKETVEAEVDLIMQEAGAGSVETPRGHIRVTSYPGEEKIGSYAILSPQAVL 
S NSEKLAC I HPLAEQVLVMTHKGRAGRYKVE PYHGKVI VPEGTAVPVQDFQALSES ATI 

ru 

UJ VFNEREFVNRYLHH I AINGGALNTDEE Y YKTVKTQDTDS E YVFD I DARKC VKREDAGP 

m 

^ LCLTGDLVDPPFHEFAYESLKTRPAAPHKVPTIGVYGVPGSGKSGIIKSAVTKKDLW 
3 SAKKENCAE 1 1 RDWRMRRMDVAARTVDS VLLNGVKHPVNTLYIDEAFACHAGTLLAL 

H 

fy lAI VKPKKWLCGDPKQCGFFNMMCLKVHFNHDI CTEVYHKS I SRRCTQTVTAI VSTL 

m 

FYDKRMKTVNPCADKI I IDTTGTTKPHKDDLILTCFRGWVKQLQIDYKNHEIMTAAAS 

□ 

QGLTRKGVYAWYKVNENPLYSQTSEHVNVLLTRTEKRIWKTIiAGDPWIKTLTAKYP 



GDFTASLDDWQREHDAIMARVLDKPQTADVFQNKVNVCWAKALEPVIATANIVLTRQQ 



WETLHPFKHDRAYSPEMALNFFCTRFFGVDIiDSGLFSAPTVALTYRDQHWDNSPGKNM 



YGLNREVAKELSRRYPC I TKAVDTGRVAD I RNNT I KD YS PT I NWPLNRRLPHSL I VD 



HKGQGTTDHSGFLSKMKGKSVLVIGDPI S I PGKKVESMGPLPTNTIRCDLDLGI PSHV 



GKYDI I FVNVRTPYRNHHYQQCEDHAIHHSMLTCKAVHHLNTGGTCVAIGYGLADRAT 



ENI I TAVARSFRFTRVCQPKMTAENTEVLFVFFGKDNGNHTHDQDRLGWLDNI YQGS 



TRYEAGRAPAYRVI RGD I SKS ADQAI VNAANSKGQPGSGVCGALYRKWPAAFDRQP I A 



VGTARLVKHEPLI IHAVGPNFSKMPEPEGDLKIiAAAYMS I AS IVNAERITKI SVPLLS 



TG I YSGGKDRVMQS LHHLFTAFDTTDADVT I YCLDKQWETRI I EAI HRKE S VE I LDDD 



KPVD I DLVRVHPNS SliAGRPG YS VNEGKIiYS YLEGTRFHQTAKD I AE I HAMWPNKS EA 



NEQICLYILGESMSSIRSKCPVEESEASAPPHTLPCLCNYAMTAERVYRLRSAKKEQF 

AVCSSFLLPKYRITGVQKIiQCSKPVLFSGWPPAVHPRKYAEIILETPPPPATTTVIC 

EPTVPERI PSPVI SRAPSAESLLSLGGVSFSSSATRSSTAWSDYDRRFWTADVHQAN 

TSTWS I PSAPGLDVQLPSDVTDSHWS I PSASGFEVRTPSVQDLTAECAKPRGLAEIMQ 

DFNTAPFQFLSDYRPVPAPRRRPI PS PRSTASAPPVPKPRRTKYQQPPGVARAI SEAE 

LDEYIRQHSNXRYEAGAYIFSSETGQGHLQQKSVRQCKLQEPILDRAVHEKYYAPRLD 

LEREKMLQKKLQLCASEGNRSRYQSRKVENMKAITAERLiISGLGTYLSSEVNPVECYR 

VNYPVPIYSSTVINRFTSAEVAVKTCNLVIQEJSTYPTVASYCITDEYDAYLDMVDGASC 

CLDTATFCPAKLRS YPKKHS YLQPE I RS AVPS P I QNTLQNVLAAATKRNClSrVTQMRE 

PVLDSAAFNVDCFKKYACNDEYWDTFRDNPIRLTTENVTQYVTKLKGPKAAALF^ 

NLKPLQEIPMDQFVMDLKRDVKVTPGTKHTEERPKVQVIQAADPIiATAYLCGIHREL^ 

RRLNAVLLPNIHTLFDMSAEDFDAI I AEHFHHGDPVLETDIASFDKSEDDAI AI SALM 

ILEDLGVDQPLLDLIEAAFGNITSVHLPTGTRFKFGAMMKSGMFLTLFVNTLVNIMIA 

SRVLRERLTTSACAASIGDDNIVHGWSDTLMAERCATWLIvny^ 

CGGF I LVDQ I TGTACRVADPLKRLFKLGKPLP VDDTQDCDRRRALHDEAMRWNR I G I T 

DEIiVKAVESRYE 1 1 LAGIil ITSLSTLAES VKNFKS I RGS P I TLYG " 

CDS 7473 . . 11183 

/note=" encodes nucleocapsid, E3, E2 , 6K and El proteins" 
/codon_start=l 

/product=" structural polyprotein" 
/protein_id= "AAF2 834 0 . 1 " 
/db_xref="GI : 6760412" 

/translation="MFPYPQLNFPPVTPTNPMAYRDPNPPRRRWRPFRPPLiAAQIEDL 

RRSIWLTFKQRSPNPPPGPPPKKKKSAPKPKPTQPKKKKQQAKRTKRKPKPGKRQRM 

CMKliESDKTFPIMLNGQWGYACVVGGRLMKPLHVEGKIDNEQLAAVKLKK^ 

YGDVPQNMKSDTLQYTSDKPPGFYNWHHGAVQYENGRFTVPRGVGGKGDSGRPILDNR 

GRWAIVLGGANEGTRTALSWTWNQKGVTIRDTPEGSEPWSLVTALCVLSNVTFPCD 
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KPPVCYSLTPERTLDVLEENVDNPNYDTLLENVLKCPSRRPKRSITDDFTLTSPYLGF 

CP YCRHSTPCF S P I KI ENVWDE SDDGS I R I QVS AQFGYNQAGTADVTKFRYMS FDHDH 

DIKEDSMEKIAISTSGPCRRLGHKGYFLLAQCPPGDSVTVSITSGASENSCTVEKKIR 

RKFVGREEYLFPPVHGKLVKCHVYDHLKETSAGYITMHRPGPHAYKSYLEEASGEVYI 

KPPSGKWTYECKCGDYSTGIVSTRTKMNGCTKAKQCIAYKSDQTKWVFNSPDLIRHT 

DHSVQGKLHI PFRLTPTVCPVPLAHTPTVTKWFKGI TLHIiTAMRPTLLTTRKLGLRAD 

ATAEWITGSTSRNFSVGREGLEYVWGNHEPYRVWAQESAPGDPHGWPHEI I IHYYHRH 

PVYTVIVLCGVAIxAILVGTASSAACIAKTU^RDCLTPYALAPNATVPTAIiAVLCCIR 

NAETFGETLNHLWFNNQPFLWAQLCIPLAALVILFRCFSCCMPFLLVAGVCLGKVDAF 

EHATTVPNVPGIPYKALVERAGYAPLNLEITWSSELTPSTNKEYVTCKFHTVIPSPQ 

VKCCGSLECKASSKADYTCRVFGGVYPFMWGGAQCFCDSENTQLSEAYVEFAPDCTID 

HAVALKVHTAAIiKVGLRIVYGNTTAHLDTFVNGVTPGSSRDLKVIAGPISAAFSPFDH 

KWIRKGLVYNYDFPEYGAMKPGAFGDIQASSLDATDIVARTDIRLLKPSVKNIHVPY 

TQAVSGYEMWKNNSGRPLQETAPFGCKIEVEPLRASNCAYGHI PI S IDI PDAAFVRSS 

ES PT I LEVS C WADC I YSADFGGSLTLQYKADREGHCPVHSHSTTAVLKEATTHVTAV 

GS I TLHFSTS S PQANF I VSLCGKKSTCNAECKPPADH 1 1 GEPHKVDQEFQAAVS KTS W 

NWLLALFGGASSLIWGLIVLVCSSMLINTRR" 

3 'UTR 11184 . . 11484 



BASE COUNT 3251 a 2842 c 2808 g 2583 t 

ORIGIN 

1 accctacaaa ctaatcgatc caatatggaa agaattcacg ttgacttaga tgctgacagc 
61 ccgtatgtca agtcgttaca gcggacgttt ccacaatttg agatcgaagc aaggcaggtc 
121 actgacaatg accatgccaa tgccagagcg ttttcgcatg tggcaacaaa gctcattgag 
181 agcgaagtcg accgggacca agttatcttg gacattggaa gtgcgcccgt cagacatgca 
241 cattccaatc accgctatca ttgtatctgc cctatgataa gcgctgaaga cccggacaga 
301 ctacaacggt atgcagaaag acttaagaaa agtgacatta ccgacaagaa catagcctct 
3 61 aaggcggcag acctgctgga agtcatgtca acaccagacg cagagactcc atctctgtgt 
421 atgcacacag acgccacgtg taggtacttt ggaagtgtag cagtatacca agatgtgtac 
481 gcagtccatg caccgacatc aatctaccac caggcgctta aaggagttag gacaatttac 
541 tggataggct ttgacacgac cccttttatg tacaaaaaca tggcaggttc ctaccctact 



601 tacaacacga actgggctga cgagagagta ttggaagcac gtaacattgg cctcggtaac 
661 tcagatcttc aggagagcag gcttggaaaa ctctcaatcc ttaggaagaa gaggctccaa 
721 cctactaata agatcatatt ctcggttggt tcaacaatct acacagaaga tagatcactg 
781 ttacgtagct ggcatcttcc aaacgtgttc cacttgaaag gaaagtctaa cttcacaggt 
841 agatgtggga ccattgtcag ctgtgaaggg tacgtcatca aaaagataac gatcagccca 
901 ggactatacg gtaaagttga gaacttggcg tccacaatgc atcgcgaggg tttcttgagt 
961 tgcaaagtca cagatacgct gcgcggcgag agggtttctt ttgctgtgtg tacgtatgta 
1021 ccagccacac tttgcgatca gatgacaggg attctggcaa ctgacgttag tgtggatgac 
1081 gcacaaaaac tattggttgg gctcaaccaa aggattgtcg tcaatggtag gacgcaaaga 
1141 aatactaaca caatgcagaa ctatctatta ccagtggtcg cccaggcgtt ttccaggtgg 
12 01 gcgcgtgaac atcgtgccga cttggacgac gagaaagaac taggggtgcg ggagcgcact 
1261 cttactatgg gctgctgctg ggctttcaag acccagaaaa tcacatccat ctacaagaag 
1321 cctggtacgc aaacaattaa gaaagtacct gccgtctttg actcatttgt gattccacgc 
1381 cttaccagcc acgggctcga tatgggcttc cgccgtaggc tcaagctgct gcttgaacca 
1441 actgtcaaac ccgcaccggc tattacaatg gccgatgtgg agcatctgcg tggcttacag 
1501 caagaagctg aagaagtggc tgcagcggaa gagatcagag aagccctgcc acccttgctc 
1561 cctgaaatag aaaaagagac cgtagaggca gaagtagacc tcattatgca agaggcagga 
1621 gcaggtagcg tggagacacc acgaggacac atcagggtga caagttaccc aggcgaagag 
1681 aagattgggt cttacgctat actttcaccc caggcggtat tgaatagtga aaaactggcg 
1741 tgtatccacc cattggcgga acaagtactg gtaatgactc acaaaggtag ggcagggaga 
1801 tacaaagtcg agccatacca cggtaaggtc attgtaccag aagggacggc ggtccctgtt 
1861 caagacttcc aggcattgag tgagagcgct acgatcgttt tcaacgagag ggagttcgta 
1921 aacagatacc tgcaccacat cgcaatcaac ggaggagcgc taaacactga cgaagagtac 
1981 tataagactg taaagactca ggacacagac tcagaatacg tcttcgatat tgacgcacga 
2041 aagtgtgtta agcgagaaga cgcaggtccc ttgtgcctaa ccggtgatct ggtagatcca 
2101 ccatttcacg agtttgcgta cgagagtctc aagacacgac cagcagcacc tcacaaagtc 
2161 ccaaccatcg gagtctatgg agtgccaggt tcaggtaaat ctggaatcat caaaagcgct 
2221 gtgactaaga aagatctggt tgtgagtgcg aagaaggaaa actgcgcaga aatcatcagg 
22 81 gatgtaagga ggatgagacg tatggatgtt gctgctagga ctgtcgattc agtgcttcta 
2341 aatggggtta agcaccccgt taacactctg tacattgatg aggcatttgc ctgccatgca 
2401 gggacgctgc tggcactgat tgccatcgtc aaacctaaga aagtggtatt gtgcggggac 
2461 ccaaaacaat gcggcttctt taacatgatg tgcctgaaag tacattttaa ccatgacata 
2 521 tgcactgaag tgtaccataa aagcatctct aggaggtgca cacagactgt aaccgccatc 
2581 gtctccacgc tcttctacga caagcgaatg aagacggtta acccatgtgc tgataaaatc 
2 641 atcatagata ccacagggac cacaaagccg cacaaagatg atctgattct aacctgtttc 
2 701 agaggatggg tgaaacagct acagattgac tacaaaaatc acgaaatcat gactgcggct 
2761 gcatcgcaag gacttacgcg gaaaggcgtt tatgctgtca ggtacaaagt caacgagaat 
2 821 ccactctact cgcagacttc tgagcacgtg aacgtgttac ttacacgcac agaaaaacgc 
2881 attgtctgga agacgctagc tggtgatccc tggataaaga cacttacagc taaatatccc 
2 941 ggggatttca cggcttcatt ggacgactgg cagcgcgaac acgacgccat tatggcacgc 
3001 gttcttgata agccgcagac agctgatgtg ttccagaata aggtgaacgt ctgctgggcg 
3061 aaggctttag agccagtctt ggccacggcc aacattgtgc tgacgagaca gcagtgggag 
3121 acgttgcacc cattcaagca tgacagagcg tactcacctg aaatggcact gaacttcttt 
3181 tgcaccaggt tctttggagt agacctggac agtgggttat tttccgctcc taccgtcgca 
3241 cttacttaca gggatcagca ctgggataac tcgccaggga agaacatgta tgggcttaat 
33 01 agagaggtag caaaggagtt gtcacggcga tatccgtgca tcacaaaagc ggttgacaca 
3361 ggcagggtag ctgatataag gaataatacc atcaaggact actctccaac aattaatgtg 
3421 gttccattaa atcgccggtt gccccactcg ttgatcgttg accacaaagg acagggtaca 
3481 actgatcaca gcggattcct atctaagatg aagggcaaat ctgtgttggt gatcggcgat 
3541 cctatcagca ttccagggaa gaaagtagag tccatgggtc cattgcccac taataccatc 
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3601 aggtgtgatc tcgatttggg aatacctagc catgtcggta aatatgacat tatctttgtc 

3661 aatgttagga ccccgtacag gaaccatcac taccaacagt gcgaggatca cgctatccac 

3721 cacagcatgc taacgtgtaa ggctgtccac cacctgaaca ctggcggaac atgtgtggct 

3781 atagggtatg ggcttgctga tcgcgcaacc gagaatatca tcactgcggt ggcacgctca 

3841 tttaggttta cccgtgtctg tcagcctaag aacactgccg aaaatactga ggttctcttc 

3901 gtgttcttcg gcaaggacaa cggcaaccac acacatgacc aggacagact cggtgtagtg 

3 961 cttgacaaca tctatcaagg gtcaaccagg tacgaggcag ggagagctcc agcgtacaga 

4 021 gtgatcagag gtgacattag caagagcgct gaccaagcta tcgttaatgc tgctaatagc 
4081 aaaggtcaac caggttccgg agtgtgcggt gcactgtacc gaaaatggcc ggctgctttt 
4141 gatagacagc caatagctgt cgggacggct agacttgtga agcacgaacc gctcatcata 
42 01 catgctgtag gacccaattt ttctaagatg ccggaaccgg agggcgacct taagctcgca 
42 61 gctgcctaca tgagcatagc gtccatcgtc aacgctgagc ggattacaaa aatatcagta 
4321 ccgctactgt caaccggcat ctattctggt ggcaaagatc gagtgatgca atcattgcat 
4381 cacctgttca ctgctttcga cactacggat gccgatgtca ccatatattg cttggataaa 

1^ 4441 caatgggaga ccaggataat cgaggccatt caccgcaaag aaagcgtcga aattctggat 

^ 4501 gatgacaagc cagtagacat tgacttggtc agggtccacc caaacagctc tttggcaggc 

Fi 4561 agaccaggtt actccgtcaa tgagggcaag ttgtattcat acctggaagg tacacgattc 

^ 4621 catcagaccg ccaaggacat tgccgaaatc catgcaatgt ggcccaacaa atctgaggct 

yi 4681 aatgagcaga tttgcttgta catcctgggg gagagtatgt ccagcatccg ctccaaatgc 

m 4741 ccagtagagg agtcagaggc gtctgctcca cctcacacac ttccatgcct gtgtaattac 

2 4801 gctatgacgg ctgagcgcgt atacaggttg cgctctgcga agaaagaaca gttcgccgta 

^ 4 8 61 tgctcatcat tcctgttgcc gaagtacagg atcacaggcg tgcagaagct acagtgcagc 

4 921 aaaccagtcc tgttttcagg cgtcgtacca ccggctgtac accccaggaa gtacgcggaa 

^ 4981 ataattctag aaacgccacc accgccagca acgacaaccg taatatgtga accaactgtg 

5041 ccagaacgta tacccagtcc ggtgatttct agagcaccaa gtgcggaatc actgctatcg 

ry 5101 cttggcggcg tctcgttctc tagctctgcc acacgctcgt caaccgcctg gagcgactat 

'y, 5161 gacaggcggt ttgtggttac agctgatgtg catcaagcga acacgtctac gtggagcatc 

S 5221 cctagtgctc ctggcttgga cgtccagctg ccttctgacg tcactgattc ccactggagt 

£ 5281 attccaagtg catcaggctt tgaagtgaga acaccatctg tacaggacct aactgcggag 

5341 tgtgcgaagc ctcgtggact ggccgaaata atgcaagact tcaatactgc tcctttccag 

54 01 tttctttcgg actacagacc agtaccggca ccacggagac gccccatccc atcacctaga 

5461 tcgacggctt ccgcacctcc agttccaaag ccacgcagga ctaagtacca acaaccacca 

5521 ggagtcgcta gagcgatctc agaagcggag ttggacgagt acatccgtca acactccaac 

5581 tgacggtatg aagcgggagc gtatattttc tcatcggaaa caggccaagg tcaccttcaa 

5641 cagaaatcag tacgtcaatg taaactacaa gaacctatat tggatcgggc cgtccatgag 

5701 aagtattacg ccccgcgcct cgatctcgaa agagagaaaa tgttacagaa gaaactgcaa 

5761 ttatgcgcct ctgaaggaaa tagaagcagg tatcaatcac gaaaagtaga aaatatgaaa 

5821 gcaattacag cggagcgact catttctgga ttgggcacat atctatcatc agaagtgaat 

5881 cctgtcgagt gttacagagt caattatcct gtaccaatct actcgtcaac ggtaattaac 

5941 aggtttacat ctgcagaggt cgcggttaaa acgtgcaact tagttatcca agagaattac 

6001 cctacagtag ccagttattg tataacagat gaatacgatg cgtatcttga catggtggac 

6061 ggcgcatcgt gctgtctaga tacagccact ttttgtccgg ctaaactgag aagctaccca 

6121 aagaagcata gctatttgca gccagagata agatcagccg tcccatcgcc tatacagaat 

6181 acattacaaa atgtattggc tgcagctact aaaaggaatt gcaacgttac ccaaatgcga 

6241 gaattacctg tcttagattc ggcggcattt aatgttgatt gtttcaagaa atacgcatgc 

6301 aatgatgagt actgggatac ctttcgcgat aaccctattc ggctaactac agagaacgtt 

63 61 acgcaatatg tgacaaagct gaaagggccg aaagcagcag cattgtttgc gaatactcat 

6421 aatctaaaac cgttgcagga gataccaatg gatcaattcg tcatggatct aaagagagat 

6481 gtcaaagtta ctcccggcac gaaacataca gaggagcggc ctaaggtgca ggttattcag 

6541 gctgcagatc cccttgctac cgcttacctt tgcgggatcc atcgggaatt agtccgtaga 
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6601 ctgaatgcgg tgcttctgcc aaatatccat actctcttcg acatgtcagc ggaagatttt 

6661 gatgcgatta ttgctgaaca tttccaccac ggcgacccag tattggaaac ggacatcgcg 

6721 tcgtttgata aaagcgaaga cgacgctatc gccatttcgg cgttgatgat ccttgaggac 

6781 ttaggtgtcg accaaccgct cttagatttg atagaggcgg cgttcggcaa tatcacatct 

6841 gtgcacctac ctacaggaac gaggtttaaa tttggtgcca tgatgaaatc cggtatgttc 

6901 ttaacgctgt ttgtcaacac actagtcaat atcatgattg ctagcagagt actacgtgaa 

6961 cggttaacca cgtcagcgtg cgcggcctct atcggcgacg ataacatagt gcatggtgtc 

7021 gtctccgaca ccttgatggc ggagagatgc gccacttggc tgaacatgga agtaaaaatt 

7081 attgatgcag ttattggtat caaagcaccc tacttctgtg ggggatttat cctggtggac 

7141 cagataacag gcacagcctg cagagtcgca gaccctctaa aaaggctttt taagcttgga 

72 01 aaaccattgc cagtcgatga tacccaagac tgcgaccgcc gccgggcact gcatgatgaa 

72 61 gcaatgcgat ggaacagaat tggaattacg gacgagttag tgaaggccgt agaatccaga 

73 21 tacgagatca tactggcagg cctgatcatc acgtctctgt ccacgttagc cgaaagcgtt 
7381 aagaacttca agagcataag agggagccca atcaccctct acggctgacc taaataggtg 
7441 acgtagtaga cacgcaccta cccaccggca gaatgtttcc ataccctcag ctgaactttc 
7501 caccagttta ccctacaaat ccgatggctt accgagatcc aaaccctcct aggcgccgct 

pi 7561 ggaggccgtt tcggcccccg ctggctgctc aaatcgaaga tcttaggagg tcgatagtca 

^ 7621 acttgacttt caaacaacga tcacctaatc cgccgccagg tccaccgcca aagaagaaga 

''^ 7681 agagtgctcc taagccaaaa cctactcagc ctaaaaagaa gaagcagcaa gccaagagga 

7741 cgaaacgcaa gcctaaacca gggaaacgac aacgtatgtg tatgaagttg gagtcggaca 

78 01 agacatttcc gatcatgctg aacggccaag tgaatggata tgcctgcgtt gtcggaggaa 

7861 ggctgatgaa accactccac gttgaaggaa aaattgataa tgagcaatta gcggccgtga 

7 921 aattgaagaa ggctagcatg tacgacttgg agtacggcga cgttccccag aacatgaaat 

7981 cagacacgct gcagtacacc agcgacaaac caccgggctt ctacaactgg caccacggcg 

8041 cagtccagta tgagaatggg agatttaccg taccgagagg agtgggcggg aaaggcgaca 



Pi 



pi 8101 gcggaagacc gatcctggac aacagaggca gagttgtggc tattgttcta ggaggtgcaa 

8161 atgagggcac gcgtacggcg ctttcagtgg tcacttggaa ccagaaaggg gtgaccatta 

f?^ 8221 gggatacccc cgaaggttct gaaccgtggt cactagttac agcgctatgc gtgctttcga 

T1 8281 atgtcacgtt cccatgcgac aaaccacccg tgtgctattc actgacgcca gaacgaacac 

^ 8341 tcgacgtgct cgaagagaac gtcgacaatc caaattacga cacgctgctg gagaacgtct 

8401 tgaaatgtcc atcacgccgg cccaaacgaa gcattaccga tgacttcaca ctgaccagtc 

84 61 cctacctggg gttctgcccg tattgcagac actcaacgcc gtgtttcagc ccaataaaaa 

8521 ttgagaacgt gtgggacgaa tctgatgatg gatcgattag aatccaggtc tcggcacaat 

8581 tcggctacaa tcaggcaggc actgcggatg tcaccaaatt ccgttacatg tctttcgacc 

8641 acgaccatga catcaaggaa gacagtatgg agaaaatagc tatcagcaca tctggaccct 

8701 gccgtcgtct tggccacaaa gggtacttcc tgttagctca atgtcctcca ggtgacagtg 

8761 taaccgtcag tatcacgagc ggagcatctg agaattcatg caccgtggag aaaaagatca 

8821 ggaggaagtt tgtcggtaga gaggagtact tgttcccacc cgtccatgga aagctggtaa 

8881 agtgccacgt ttacgatcac ttgaaggaga cgtctgccgg gtacataacc atgcacaggc 

8 941 caggcccaca cgcgtataag tcctatctgg aggaagcgtc aggcgaagtg tacattaaac 

9001 caccttctgg caagaacgtc acctacgaat gtaagtgtgg cgactacagc acaggtatcg 

9061 tgagcacgcg aacgaagatg aacggctgca ctaaagcaaa acagtgcatt gcctacaaga 

9121 gcgaccaaac gaaatgggtc ttcaactcgc cggatcttat taggcacaca gaccactcag 

9181 tgcaaggtaa attgcacatt ccattccgct tgacaccgac agtctgcccg gttccgttag 

9241 ctcacacgcc tacagtcacg aagtggttca aaggcatcac cctccacctg actgcaatgc 

93 01 gaccaacatt gctgacaacg agaaaattgg ggctgcgagc agacgcaaca gcagaatgga 

93 61 ttacagggtc tacatccagg aatttttctg tggggcgaga agggctggag tacgtatggg 

9421 gtaaccatga accagtcaga gtctgggccc aggagtcggc accaggcgac cpacatggat 

9481 ggccgcatga gatcatcatc cactattatc atcggcatcc agtctacact gtcattgtgc 

9541 tgtgtggtgt cgctcttgct atcctggtag gcactgcatc atcagcagct tgcatcgcca 



9601 aagcaagaag agactgcctg acgccatacg cgcttgcacc gaacgcaacg gtacccacag 
9661 cattagcggt tttgtgctgc attcggccaa ccaacgctga aacatttgga gaaactttga 
9721 accatctgtg gtttaacaac caaccgtttc tctgggcaca gttgtgcatt cctctggcag 
9781 cgcttgttat tctgttccgc tgcttttcat gctgcatgcc ttttttattg gttgcaggcg 
9841 tctgcctggg gaaggtagac gccttcgaac atgcgaccac tgtgccaaat gttccgggga 
9901 tcccgtataa ggcgttggtc gaacgcgcag gttacgcgcc acttaacctg gagatcacgg 
9961 tcgtctcatc ggaattaaca ccttcaacta acaaggagta cgtgacctgc aaattccaca 
10021 cagtcattcc ttcaccacaa gttaaatgct gcgggtccct cgagtgcaag gcatcctcaa 
10081 aggcggatta cacatgccgc gtttttggcg gtgtgtaccc tttcatgtgg ggaggcgcac 
10141 aatgcttctg tgacagtgag aacacacaac tgagtgaggc gtacgtcgag ttcgctccag 
10201 actgcactat agatcacgca gtcgcactaa aagttcacac agctgctctg aaagtcggcc 
10261 tgcgtatagt atacggcaac accaccgcgc acctggatac gtttgtcaat ggcgtcacgc 
10321 caggttcctc acgggacctg aaggtcatag cagggccgat atcagccgct ttttcaccct 
10381 ttgaccataa ggtcgtcatc agaaaggggc ttgtttacaa ctacgacttc cctgagtatg 
10441 gagctatgaa accaggagcg ttcggcgata ttcaagcatc ctcgcttgat gctacagaca 
M, 10501 tagtagcccg cactgacata cggctgctga agccttctgt caagaacatc cacgtcccct 
p 10561 acacccaagc agtatcaggg tatgaaatgt ggaagaacaa ctcaggacga cccctgcaag 
Q 10621 aaacagcacc atttggatgt aaaattgaag tggagcctct gcgagcgtct aactgtgctt 
flj 10681 acgggcacat ccctatctcg attgacatcc ctgatgcagc ttttgtgaga tcatcagaat 
Ijj 10741 caccaacaat tttagaagtt agctgcacag tagcagactg catttattct gcagactttg 
@1 10801 gtggttctct aacattacag tacaaagctg acagggaggg acattgtcca gttcactccc 
M 10861 actccacgac agctgttttg aaggaagcga ccacacatgt gactgccgta ggcagcataa 
kO 10921 cactacattt tagcacatcg agcccacaag caaattttat agtttcgcta tgcggcaaga 
5 10981 agtccacctg caatgctgaa tgtaaaccac cggccgacca cataattgga gaaccacata 
M 11041 aagtcgacca agaattccag gcggcagttt ccaaaacatc ttggaactgg ctgcttgcac 
ry 11101 tgtttggggg agcatcatcc ctcattgttg taggacttat agtgttggtc tgcagctcta 
nj 11161 tgcttataaa cacacgtaga tgactgagcg cggacactga catagcggta aaactcgatg 
M 11221 tacttccgag gaagcgtggt gcataatgcc acgcgccgct tgacactaaa actcgatgta 
O 11281 tttccgagga agcacagtgc ataatgctgt gcagtgtcac attaatcgta tatcacacta 
H" 11341 catattaaca acactatatc acttttatga gactcactat gggtctctaa tatacactac 
11401 acatatttta cttaaaaaca ctatacacac tttataaatt cttttataat ttttcttttg 
11461 tttttatttt gtttttaaaa tttc 



// 




SEQUENCE LISTING - pcDWXH-7 - SEQ ID NO 2 

<210> SEQ ID NO 2 

<211> LENGTH: 4150 

<212> TYPE: DNA 

<213> ORGANISM: WESTERN EQUINE ENCEPHALITIS VIRUS 

<214> FEATURES : 

Location/Qualifiers 

source 1 . . 4150 

/organism^ "Western equine encephalomyelitis virus" 

/strain=" 71V- 165 8" 

vector sequence 1-9 

5' Sacl primer 9-20 

3' end of NS4 gene 16-114 

intragenic region 115-159 

CDS 160-3869 

/note=" encodes capsid, E3 , E2 , 6K and El proteins" 
/codon__start = 16 0 

/product=" structural polyprotein" 
/translation="MFPYPQLNFP P VYPTNPMAYRDPNP PRRRWRPFRPPLAAQ I EDL 
RRSIVNLTFKQRSPNPPPGPPPKKKKSAPKPKPTQPKKKKQQAKRTKRKPKPGKRQRM 
CMKLESDKTFP IMLNGQVNGYACVVGGRLMKPLHVEGKI DNEQIAAVKLKKASMYDLE 
YGDVPQNMKSDTLQYTSDKPPGFYNWHHGAVQYENGRFTVPRGVGGKGDSGRPILDNR 
GRWAIVLGGANEGTRTALSWTWNQKGVTIRDTPEGSEPWSLVTALCVLSNVTFPCD 
KPPVCYSLTPERTLDVLEENVDNPNYDTLLENVLKCPSRRPKRSITDDFTLTSPYLGF 
CPYCRHSTPCFSP I KI ENVWDESDDGS IRIQVSAQFGYNQAGTADVTKFRYMSFDHDH 
DIKEDSMEKIAISTSGPCRRLGHKGYFLLAQCPPGDSVTVSITSGASENSCTVEKKIR 
RKFVGREEYLFPPVHGKLVKCHVYDHLKETSAGYITMHRPGPHAYKSYLEEASGEVYI 
KPPSGKNVTYECKCGDYSTGIVSTRTKMNGCTKAKQCIAYKSDQTKWVFNSPDLIRHT 
DHSVQGKLHI PFRLTPTVCPVPLAHTPTVTKWFKGI TLHLTAMRPTLLTTRKLGLRAD 
ATAEWITGSTSRNFSVGREGLEYVWGNHEPVRVWAQESAPGDPHGWPHEI I IHYYHRH 
P VYT VI VLCGVALA I L VGTAS S AAC I AKARRDCLTP YALAPNATVPTALAVLCC I RPT 
NAETFGETLNHLWFNNQPFLWAQLC I PLAALVI LFRCFSCCMPFLLVAGVCLGKVDAF 
EHATTVPNVPGI PYKALVERAGYAPLNLE ITWSSELTPSTNKEYVTCKFHTVI PSPQ 



VKCCGSLECKASSKADYTCRVFGGVYPFMWGGAQCFCDSENTQIiSEAYVEFAPDCTID 



HAVALKVHTAALKVGLRIVYGNTTAHLDTFVNGVTPGSSRDLKVIAGPISAAFSPFDH 

KWI RKGLVYNYDFPE YGAMKPGAFGD I QAS SLDATD I VARTD I RLLKPS VKNI HVP Y 

TQ AVS G YEMWKNNS GR PLQETAP FGCKI EVE PLRASNCAYGH I P I S I D I PDAAF VRS S 

E S PT I LE VS CTVADC I YS ADFGGS LTLQ YKADREGHC PVHSHSTTAVLKEATTHVTAV 

GSITLHFSTSSPQANFIVSLCGKKSTCNAECKPPADHIIGEPHKVDQEFQAAVSKTSW 

NWLLALFGGASSLIWGLIVLVCSSMLINTRR" 

3'UTR 3 8 70-4150 

BASE COUNT ORIGIN 

1 ggccctctag agctcatact ggcaggcctg atcatcacgt ctctgtccac gttagccgaa 
61 agcgttaaga acttcaagag cataagaggg agcccaatca ccctctacgg ctgacctaaa 
121 taggtgacgt agtagacacg cacctaccca ccggcagaat gtttccatac cctcagctga 
181 actttccacc agtttaccct acaaatccga tggcttaccg agatccaaac cctcctaggc 
241 gccgctggag gccgtttcgg cccccgctgg ctgctcaaat cgaagatctt aggaggtcga 
301 tagtcaactt gactttcaaa caacgatcac ctaatccgcc gccaggtcca ccgccaaaga 

3 61 agaagaagag tgctcctaag ccaaaaccta ctcagcctaa aaagaagaag cagcaagcca 
421 agaggacgaa acgcaagcct aaaccaggga aacgacaacg tatgtgtatg aagttggagt 

4 81 cggacaagac atttccgatc atgctgaacg gccaagtgaa tggatatgcc tgcgttgtcg 
541 gaggaaggct gatgaaacca ctccacgttg aaggaaaaat tgataatgag caattagcgg 
6 01 ccgtgaaatt gaagaaggct agcatgtacg acttggagta cggcgacgtt ccccagaaca 
661 tgaaatcaga cacgctgcag tacaccagcg acaaaccacc gggcttctac aactggcacc 
721 acggcgcagt ccagtatgag aatgggagat ttaccgtacc gagaggagtg ggcgggaaag 
781 gcgacagcgg aagaccgatc ctggacaaca gaggcagagt tgtggctatt gttctaggag 
841 gtgcaaatga gggcacgcgt acggcgcttt cagtggtcac ttggaaccag aaaggggtga 
901 ccattaggga tacccccgaa ggttctgaac cgtggtcact agttacagcg ctatgcgtgc 
961 tttcgaatgt cacgttccca tgcgacaaac cacccgtgtg ctattcactg acgccagaac 
1021 gaacactcga cgtgctcgaa gagaacgtcg acaatccaaa ttacgacacg ctgctggaga 
1081 acgtcttgaa atgtccatca cgccggccca aacgaagcat taccgatgac ttcacactga 
1141 ccagtcccta cctggggttc tgcccgtatt gcagacactc aacgccgtgt ttcagcccaa 
12 01 taaaaattga gaacgtgtgg gacgaatctg atgatggatc gattagaatc caggtctcgg 
1261 cacaattcgg ctacaatcag gcaggcactg cggatgtcac caaattccgt tacatgtctt 
1321 tcgaccacga ccatgacatc aaggaagaca gtatggagaa aatagctatc agcacatctg 
1381 gaccctgccg tcgtcttggc cacaaagggt acttcctgtt agctcaatgt cctccaggtg 
1441 acagtgtaac cgtcagtatc acgagcggag catctgagaa ttcatgcacc gtggagaaaa 
1501 agatcaggag gaagtttgtc ggtagagagg agtacttgtt cccacccgtc catggaaagc 
1561 tggtaaagtg ccacgtttac gatcacttga aggagacgtc tgccgggtac ataaccatgc 
1621 acaggccagg cccacacgcg tataagtcct atctggagga agcgtcaggc gaagtgtaca 
16 81 ttaaaccacc ttctggcaag aacgtcacct acgaatgtaa gtgtggcgac tacagcacag 
1741 gtatcgtgag cacgcgaacg aagatgaacg gctgcactaa agcaaaacag tgcattgcct 
18 01 acaagagcga ccaaacgaaa tgggtcttca actcgccgga tcttattagg cacacagacc 
1861 actcagtgca aggtaaattg cacattccat tccgcttgac accgacagtc tgcccggttc 
1921 cgttagctca cacgcctaca gtcacgaagt ggttcaaagg catcaccctc cacctgactg 



1981 caatgcgacc aacattgctg acaacgagaa 
2 041 aatggattac agggtctaca tccaggaatt 
2101 tatggggtaa ccatgaacca gtcagagtct 
2161 atggatggcc gcatgagatc atcatccact 
2221 ttgtgctgtg tggtgtcgct cttgctatcc 
22 81 tcgccaaagc aagaagagac tgcctgacgc 
2341 ccacagcatt agcggttttg tgctgcattc 
2401 ctttgaacca tctgtggttt aacaaccaac 
2461 tggcagcgct tgttattctg ttccgctgct 
2 521 caggcgtctg cctggggaag gtagacgcct 
2 581 cggggatccc gtataaggcg ttggtcgaac 
2 641 tcacggtcgt ctcatcggaa ttaacacctt 
2701 tccacacagt cattccttca ccacaagtta 
2761 cctcaaaggc ggattacaca tgccgcgttt 
2821 gcgcacaatg cttctgtgac agtgagaaca 
2 881 ctccagactg cactatagat cacgcagtcg 

2 941 tcggcctgcg tatagtatac ggcaacacca 
3001 tcacgccagg ttcctcacgg gacctgaagg 

3 061 caccctttga ccataaggtc gtcatcagaa 
3121 agtatggagc tatgaaacca ggagcgttcg 
3181 cagacatagt agcccgcact gacatacggc 
3241 tcccctacac ccaagcagta tcagggtatg 
33 01 tgcaagaaac agcaccattt ggatgtaaaa 
3361 gtgcttacgg gcacatccct atctcgattg 
3421 cagaatcacc aacaatttta gaagttagct 
3481 actttggtgg ttctctaaca ttacagtaca 
3541 actcccactc cacgacagct gttttgaagg 
3601 gcataacact acattttagc acatcgagcc 
3 661 gcaagaagtc cacctgcaat gctgaatgta 
3 721 cacataaagt cgaccaagaa ttccaggcgg 
3781 ttgcactgtt tgggggagca tcatccctca 
3841 gctctatgct tataaacaca cgtagatgac 
3 901 tcgatgtact tccgaggaag cgtggtgcat 
3 961 gatgtatttc cgaggaagca cagtgcataa 
4021 acactacata ttaacaacac tatatcactt 
4081 cactacacat attttactta aaaacactat 
4141 ctttaggttt 

// 



aattggggct gcgagcagac gcaacagcag 
tttctgtggg gcgagaaggg ctggagtacg 
gggcccagga gtcggcacca ggcgacccac 
attatcatcg gcatccagtc tacactgtca 
tggtaggcac tgcatcatca gcagcttgca 
catacgcgct tgcaccgaac gcaacggtac 
ggccaaccaa cgctgaaaca tttggagaaa 
cgtttctctg ggcacagttg tgcattcctc 
tttcatgctg catgcctttt ttattggttg 
tcgaacatgc gaccactgtg ccaaatgttc 
gcgcaggtta cgcgccactt aacctggaga 
caactaacaa ggagtacgtg acctgcaaat 
aatgctgcgg gtccctcgag tgcaaggcat 
ttggcggtgt gtaccctttc atgtggggag 
cacaactgag tgaggcgtac gtcgagttcg 
cactaaaagt tcacacagct gctctgaaag 
ccgcgcacct ggatacgttt gtcaatggcg 
tcatagcagg gccgatatca gccgcttttt 
aggggcttgt ttacaactac gacttccctg 
gcgatattca agcatcctcg cttgatgcta 
tgctgaagcc ttctgtcaag aacatccacg 
aaatgtggaa gaacaactca ggacgacccc 
ttgaagtgga gcctctgcga gcgtctaact 
acatccctga tgcagctttt gtgagatcat 
gcacagtagc agactgcatt tattctgcag 
aagctgacag ggagggacat tgtccagttc 
aagcgaccac acatgtgact gccgtaggca 
cacaagcaaa ttttatagtt tcgctatgcg 
aaccaccggc cgaccacata attggagaac 
cagtttccaa aacatcttgg aactggctgc 
ttgttgtagg acttatagtg ttggtctgca 
tgagcgcgga cactgacata gcggtaaaac 
aatgccacgc gccgcttgac actaaaactc 
tgctgtgcag tgtcacatta atcgtatatc 
ttatgagact cactatgggt ctctaatata 
acacacttta taaattctct cataatttca 



9-^ 



SEQUENCE LISTING - pVHX-6 - SEQ ID NO 3 



<200> SEQUENCE CHARACTERISTICS 

<210> SEQ ID NO 3 

<211> LENGTH: 12 60 

<212> TYPE: DNA 

<213> ORGANISM: EXAMPLE OF A CMV PROMOTER COUPLED TO THE pcDW-XH7 

SEQUENCE 

<22 0> FEATURE: 

1-3 02 pVAX vector sequence 

3 02-4 01 CMV promoter 
f==i 411 CMV putative transcriptional start site 

g 444-463 T7 promoter 

ry 464-509 pVAX multicloning region 

1^1 510 start of pcDW-HX45 sequence 

SI 

tfi 1 cattgacgtc aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg 

^ 61 tatcatatgc caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat 

121 tatgcccagt acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc 

ry 181 atcgctatta ccatggtgat gcggttttgg cagtacatca atgggcgtgg atagcggttt 

nj 241 gactcacggg gatttccaag tctccacccc attgacgtca atgggagttt gttttggcac 

301 caaaatcaac gggactttcc aaaatgtcgt aacaactccg ccccattgac gcaaatgggc 

p 3 61 ggtaggcgtg tacggtggga ggtctatata agcagagctc tctggctaac tagagaaccc 

p, 421 actgcttact ggcttatcga aattaatacg actcactata gggagaccca agctggctag 

481 cgtttaaact taagcttggt accgagctca tactggcagg cctgatcatc acgtctctgt 

541 ccacgttagc cgaaagcgtt aagaacttca agagcataag agggagccca atcaccctct 

601 acggctgacc taaataggtg acgtagtaga cacgcaccta cccaccgcca gaatgtttcc 

661 ataccctcag ctgaactttc caccagttta ccctacaaat ccgatggctt accgagatcc 

721 aaaccctcct aggcgccgct ggaggccgtt tcggcccccg ctggctgctc aaatcgaaga 

781 tcttaggagg tcgatagtca acttgacttt caaacaacga tcacctaatc cgccgccagg 

841 tccaccgcca aagaagaaga agagtgctcc taagccaaaa act act cage ctaaaaagaa 

901 gaagcagcaa gccaagagga cgaaacgcaa gcctaaacca gggaaacgac aacgtatgtg 

961 tatgaagttg gagtcggaca agacatttcc gatcatgctg aacggccaag tgaatggata 

1021 tgcctgcgtt gtcggaggaa ggctgatgaa accactccac gttgaaggaa aaattgataa 

1081 tgagcaatta gcggccgtga aattgaagaa ggctagcatg tacgacttgg agtacggcga 

1141 cgttccccag aacatgaaat cagacacgct gcagtacacc agcgacaaac caccgggctt 

12 01 ctacaactgg caccacggcg cagtccagta tgagaatggg agatttaccg taccgagagg 

.... CONTINUOUS WITH pcDW-HX45 (pcDW-XH7) SEQUENCE 



4177 gaatgtaaac caccggccga ccacataatt 
4237 caggcggcag tttccaaaac atcttggaac 
4297 tccctcattg ttgtaggact tatagtgttg 
4357 agatgactga gcgcggacac tgacatagcg 
4417 ggtgcataat gccacgcgaa gcttgacact 
4477 tgcataatgc tgtgcagtgt cacattaatc 
4537 atcactttta tgagactcac tatgggtctc 
4597 acactataca cactttataa attcttttat 



ggagaaccac ataaagtcga ccaagaattc 
tggctgcttg cactgtttgg gggagcatca 
gtctgcagct ctatgcttat aaacacacgt 
gtaaaactcg atgtacttcc gaggaagcgt 
aaaactcgat gtatttccga ggaagcacag 
gtatatcaca ctacatatta acaacactat 
taatatacac tacacatatt ttacttaaaa 
aatttttctt ttgctttaga gcacactggc 



4657 ggccgttact agtggatccg agctctagag ggcc 



4177-4643 pcDW-HX45 
4644-4790 vector sequence // 



